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Abstract — Consider the estimation of an unknown parameter 
vector in a linear measurement model. Centralized sensor selec- 
tion consists in selecting a set of k s sensor measurements, from 
a total number of m potential measurements. The performance 
of the corresponding selection is measured by the volume of an 
estimation error covariance matrix. In this work, we consider the 
problem of selecting these sensors in a distributed or decentral- 
ized fashion. In particular, we study the case of two leader nodes 
that perform naive decentralized selections. We demonstrate 
that this can degrade the performance severely. Therefore, two 
heuristics based on convex optimization methods are introduced, 
where we first allow one leader to make a selection, and then 
to share a modest amount of information about his selection 
with the remaining node. We will show that both heuristics 
clearly outperform the naive decentralized selection, and achieve 
a performance close to the centralized selection. 

I. Introduction 

Consider a linear model where a centralized collector es- 
timates an n-dimensional parameter vector via an arrange- 
ment of to sensors. The sensor readings are affected by 
measurement noise. The noise samples are assumed to be 
realizations of independent identically distributed Gaussian 
random variables. Now suppose the collector is allowed to 
use k s active sensors only, where n < k s < m. We call 
such a situation a centralized sensor selection problem. The 
performance of a particular selection can be assessed by the 
volume of the estimation error covariance matrix [1 , Sec. II. A]. 
Therefore, the objective of the sensor selection problem is 
to select k s sensors such that this volume is minimized. For 
this purpose, the centralized collector must know the complete 
measurement matrix, which is needed for calculating the error 
covariance. 

In contrast, consider the sensor arrangement as depicted in 
Fig.Q] We have a partition of all sensors into two groups. Each 
sensor group is associated with a specific leader node. The 
decentralized sensor selection problem consists in selecting a 
subset of sensors by the corresponding leader nodes individu- 
ally. After that, the individual selections are transmitted to the 
centralized collector. The main advantage of such an approach 
is that we do not need to know the complete measurement 
matrix at one point, i.e., at the centralized collector. This 
can be motivated, for example, by limitations of the available 
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transmission bandwidth in a sensor network. However, there is 
no guarantee that individual selections minimize the volume of 
the error covariance matrix. The reason is that the decentral- 
ized leader nodes may choose jointly correlated measurements, 
without even knowing it. In this work, we propose two simple 
heuristic methods for decentralized sensor selection. Both 
heuristics try to avoid jointly correlated measurements by 
transmitting a modest amount of data between leader nodes. 
We will show by numerous numerical experiments that the 
performance can be very close to the centralized solution. 

The mathematical form of the sensor selection used in this 
paper was introduced in (TJ, (2). In particular, the authors 
in HI study the sensor selection problem embedded in the 
framework of convex optimization. Throughout this paper, we 
will make extensive use of this approach. In |3|, a multi- 
step sensor selection strategies based on the Kalman filter 
error covariance matrix is investigated. Other authors propose 
single sensor scheduling algorithms, e.g., Q, 0. A different, 
but conceptually related approach is the selection of reliable 
sensors in the context of robust sensing [6]. 

The remainder of the paper is organized as follows. Section 
II introduces the centralized and decentralized sensor selection 
problem. To the best of our knowledge, the latter was not 
considered by others in this form. In Section III, two heuristics 
for decentralized sensor selection are motivated and developed. 
The solution of the (nonconvex) sensor selection problem is 
outlined in Section IV. In Section V, extensive numerical 
simulation illustrates the performance gains of the proposed 
decentralized heuristics. 
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Fig. 1. Arrangement of different sensors and leader nodes. Sensors (•) are 
associated with leader node 1, sensors (*) with leader node 2. Both leader 
nodes may share a (very) limited amount of information. 



II. Sensor Selection 

A. System Model 

A linear measurement model can be written as fl] Sec. II. A] 
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where x E K™ is an unknown parameter vector that we 
want to estimate, y 6 R m is the measurement vector, and 
m > n. Throughout this paper, we will use the terms 
sensor and measurement synonymously. The measurements 
are corrupted by noise V\,...,v m that is independent and 
identically distributed (iid) with Af(0,cr 2 ). The measurement 
matrix 



A = 



(2) 



is assumed to have full column rank, i.e., rank(A) = n. The 
maximum-likelihood estimate of x is then given by 



XML = 




z=l 



(3) 



The covariance matrix of the estimation error x — xml has the 
form 

/ „, \ -1 



E = a 2 (A T Ay=a 2 PTa^ 



(4) 



We measure the quality of the estimation by the volume of 
this matrix. It can be shown HI Sec. II. A] that this measure is 
related to the log-volume of a confidence ellipsoid given by 



1 



logvol(£ ) ~ const. logdet 



(5) 



This volume is a scalar measure for how informative the 
measurements are or how uncertain we have to be about our 
estimate xml- In particular, a small volume corresponds to a 
small uncertainty, and vice versa. 

B. Centralized Sensor Selection Problem 

Now suppose we have a total number of m measurements. 
A central collector attempts to find a subset of fc s < m 
measurements that minimizes the uncertainty about xml- This 
leads to the centralized sensor selection problem that is stated 
in 0] Sec. II.B] as 



maximize f cen (z) = logdet 

z 

subject to 



^ Zicnal 



l T z = k s 
Zi e {0,1}, 



(6) 



i — 1, . . . , m 



where 1 is a vector of appropriate dimension with all entries 
equal to one. Each Boolean variable Z{ corresponds to a 
particular choice of a measurement. Whenever Zi = 1, the ith 
measurement is to be used. The linear constraint 1 T z — k s is 



a budget constraint on the total number of active sensors. On 
occasion, we will rewrite the objective as 



/cenO) = logdet (A T diag(z)A) , 



(7) 



where the matrix A T di&g(z)A is assumed to be positive 
definite Q Ch. 7]. 

Due to the Boolean constraints in (O, the centralized sensor 
selection problem is a nonconvex optimization problem and is 
generally hard to solve. However, note that the objective is 
a concave function for Zi > lU Sec. 3.1.5]. Relaxing the 
Boolean constraints to < z s ; < 1, the relaxed centralized 
sensor selection problem has the form 



maximize fcen(z) 

z 

subject to l T z = fc s 



(8) 



0< Zi < 1, i= 1,.. 



, m 



This problem is a convex optimization problem and hence can 
be solved efficiently, where the solution is denoted as z* en . The 
relaxation gives us a global upper bound for the centralized 
sensor selection problem (O. The bound is given by 



fcen( z cen ), 



(9) 



and is subsequently used as a global performance measure for 
decentralized strategies. 

In order to obtain a feasible solution to © we apply a sim- 
ple rounding scheme as suggested in [1]. In this scheme, the 
elements of z* en are rearranged in descending order. After that, 
the fc s largest elements are set equal to 1, and the remaining 
elements to 0. This gives us a suboptimal solution z cen to ©, 
along with a lower bound L cen = / C en(z C en)- This lower bound 
and the corresponding duality gap A cen = U cen — L cen will be 
used later in order to compare centralized and decentralized 
methods. In particular, when this gap becomes sufficiently 
small, z cen is nearly optimal for problem (0. Note that it 
is possible to apply more sophisticated rounding schemes HI 
Sec. III.E]. Since we focus on the comparison of centralized 
and decentralized strategies, we will use only the prescribed 
simple rounding. 

C. Decentralized Sensor Selection Problem 

Selecting sensors in a centralized fashion, the full mea- 
surement matrix A must be known at one point, e.g., the 
centralized collector. Now suppose we have two leader nodes 
that have access to half of the measurements m/2 > n via the 
partition 



A = 



At 
A 2 



leader node 1: A\, leader node 2: A? 



(10) 

where Ai,A 2 G W n / 2xn , and rank(Ai) = rank^) = n. 
In the decentralized sensor selection problem considered here, 
both leader nodes are only allowed to select fc s /2 sensors each. 
For each leader node I with I £ {1,2}, we first solve the 



relaxed optimization problems 



A. Main Idea 



1/2 

maximize fi(z t ) = logdet I V" z u a u aj t 



(11) 



subject to 1 T zi = fc s /2 

0<« M <1, i = l,...,m/2 

where z* is the optimal solution of (fTTb . We call this approach 
naive decentralized sensor selection or simply decentralized 
sensor selection. After computing z\, both leader nodes apply 
the simple rounding scheme separately in order to obtain the 
selections zi. Finally, both nodes transmit their selections to 
the centralized collector. 

D. Suboptimality and Performance 

The (global) performance of any decentralized method has 
to be judged at the centralized collector based on the full 
problem, i.e., the centralized objective / cen . The global upper 
bound is given by the expression in ®. For calculating the 
lower bound, we first stack the solution vectors 



-de< 



(12) 



and insert them into the centralized objective 

/cen(^dec) = log det 

= logdet (A T diag(z d ec)A) 
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Our measure for any decentralized strategy is then given by 
the suboptimality gap 



^dec — Uct 



L, 



dec- 



(14) 



Furthermore, the set of feasible solutions of (fTTb is a subset 
of (0. Therefore, we can also conclude that / C en(z* en ) > 

/cen(^dec)- 

Interestingly, it is not possible to make general statements 
about the lower bounds L cen and Ldec, respectively. The 
reason is that the prescribed simple rounding scheme produces 
one suboptimal solution for the centralized sensor selection 
problem (0. In principle, it is possible that the rounding 
from the decentralized leader nodes results in a different 
suboptimal solution, which in turn achieves a higher lower 
bound. However, as numerical evaluation suggests (see Sec.lVli 
this effect does not occur very often. 

III. Approach 

In this section, we will introduce two methods for solving 
the centralized sensor selection problem © in a partially 
decentralized manner. Partially decentralized means that we 
are willing to transmit a negligible amount of data from 
one leader node to the other. In particular, we allow the 
transmission of N <C m vectors of dimension R™. Without 
loss of generality, we assume that leader node 1 shares some 
vectors with leader node 2. These methods can be seen as 
simple heuristics that attempt to improve the lower bound Ldec 
and, accordingly, the suboptimality gap ( [Pil l. 



The main idea behind both heuristics can be described 
as follows. Assume we have measurements that are approx- 
imately collinear, i.e., rows from the matrix A are weakly 
correlated. In the case of two rows, it follows from the error 
covariance matrix that 



T 

a k a k 



(1 +7)oj-aJ, 



7 e 



(15) 



In the above expression, we have the sum of two rank-1 
matrices that can be approximately be rewritten as a scaled 
version of one rank-1 matrix. This means, we have (effec- 
tively) lost one rank. Now recall that each binary variable Zi 
corresponds to a specific selection out of all rows from A. 
Since YlltLi Zi a i a J I s symmetric, it is orthogonally diagonal- 
izable and we can rewrite the objective © as 

(m \ n 

= logdet (UAU T ) = ^ log Aj, 
i=l / i=l 

where Xi are the (positive) eigenvalues of A T d'mg(z)A. 
Selecting many weakly correlated measurements via z will 
reduce the effective rank of the above matrix. This increases 
the number of small eigenvalues, and therefore increases the 
total volume of the corresponding confidence ellipsoid. 

Unless we are allowed to use k = rn measurements, the cen- 
tralized sensor selection will avoid correlated measurements 
since they do not add significantly to the objective in ©. 
This situation is different for the decentralized sensor selection 
problem. Both leader nodes maximize their own ellipsoid, 
using only the local data A\ and A2, respectively. However, it 
is possible and likely that their individual selections are jointly 
correlated, which in turn leads to a smaller global ellipsoid. 
Now suppose leader node 1 shares some information about 
the largest contribution to its own ellipsoid. Leader node 2 
would avoid picking the same contribution. This is the main 
idea behind both heuristics. 

To be more specific, suppose only leader node 1 has solved 
the relaxed decentralized sensor selection problem (fTTl i. The 
main contribution to the volume of the local ellipsoid is 
given by the largest eigenvalues of the matrix A^di&g^^Ai. 
Denote Ai, . . . , Ajv the N largest eigenvalues and u\, . . . , Un 
the associated eigenvectors with that matrix. We will now 
introduce two heuristics that capitalize the influence of these 
eigenvalues and -vectors. 

B. Focused Diversity Method 

Leader node 2 modifies its data matrix A2 and selection 
vector Z2 such that 



^4fdm = 



A 2 



Z2 
z m/2+l 

_ z m/2+N, 



(16) 



We can now rewrite objective of the decentralized sensor 
selection problem for leader node 2 as 

logdet (Af dm diag(z)A fdm 

/ m/2 

= logdet 



N 

(I >,(l -2 i + ^2 Zi+m/2K u i u i 
i=l 



(17) 
(18) 



Leader node 2 must avoid the same directions or contributions 
that were already made by leader node 1 . One way to achieve 



this is to set z n 



,/2+l 



= ... = Z„ 



/2+N 



= 1. Basically, leader 



node 1 has already made a decision for the second node. The 
objective for leader node 2 has then the form 



/2,fdm(2 2 ) = logdet 




(19) 



Hence the remaining relaxed optimization problem to be 
solved is given by 

/2,fdm(^2) 

(20) 

.,m/2 

z%. We call this 



maximize 

22 

subject to 



1 T Z 2 = fcs/2 



1, 



2.fdm 



< z 2l < 1, 

where the solution is denoted as z. 
heuristic focused diversity method. Note that the underlying 
maximization problem is still concave. After solving, leader 
node 2 performs the simple rounding scheme in order to obtain 
^2,fdm- The upper and lower bounds for the focused diversity 
method are then calculated based on the vectors 



'fdm 



■ z\ - 








i z fdm — 




7* 

_ 2,fdm_ 




z 2,fdm_ 



(21) 



C. Linear Penalty Method 

Another way to force leader node 2 to avoid the main 
directions from leader node 1 is to introduce a linear penalty 
for choosing similar measurements. This can be accomplished 
by adding an additional term to the objective given in (fTTT >. 
Consider the quantity 



\a 2 i 



02i 2 



2 2i^j u j 



1 



|a 2j 



(22) 



similarity relevance 

where i = 1, . . . , m/2, and j = 1,...,N. Consider the LHS 
first. Whenever a measurement a,2i has a direction that is 
similar to a main direction from leader node 1, the absolute 
value of the normalized inner product between a2i and Uj 
will be Targe'. In order to account for a possibly higher 
contribution from A2, we put an additional weight on this 
similarity, which is called relevance. Note that we only need 
to calculate the product XjUj. Therefore, the RHS of (l22l does 
not violate the restriction that only N vectors from leader node 
1 can be shared. 

Using the quantity d22l we can now construct a penalty 
term. Denote the cost vector 



N 

E 



1 



Mil 



>o, 



l,...,m/2. (23) 



We rewrite the objective for leader node 2 as 



'm/2 



/2,i P m(2 2 ) = logdet ^ Z2ia2ia^ l 



= /2O2) - c T z 2l 



m/2 

CiZ 2 i 



(24) 



(25) 



where c = [c\ ■ ■ ■ c m / 2 ]- The resulting heuristic is called 
linear penalty method, associated with the (relaxed) concave 
optimization problem 

/2,lpm(^2) 



maximize 

22 

subject to 



l T z 2 = k s /2 
0<z 2l < 1. 



(26) 



1. 



,m/2 



The solution is called zJipm : ~ z 2' an d tne resulting vectors 
for calculating the upper and lower bound are given by 



" 4 ' 








: z lpm — 




_ z 2,lpm_ 


z 2,lpm_ 



lpm 



IV. Solving a Sensor Selection Problem 



(27) 



In order to solve a concave maximization problem similar 
to (JHJ, several methods are at hand. One could resort to 
optimization software like CVX (9)- In our case, we have 
implemented a logarithmic barrier method, see (8l Sec. 11.3]. 
A full implementation will be made available on our website. 
The reason for choosing a barrier method is that it can be 
implemented without great effort. However, other interior- 
point methods, which are used in practice more often (for 
example, primal-dual methods), may solve the above problems 
within fewer iterations and higher accuracy. 

From [1, Sec. Ill D], the approximate objective of the 
relaxed sensor selection is given by 



V , cen(^)=log det 



log det 



\ v T 

Z{Q>l(X^ 



ZlCLid^ 



f/s5^(log(«4)+log(l-zi)) 

i=l 

(28) 

+ Mz). (29) 



The authors also give explicit expressions for the gradient 
Vi/'cenl^) and Hessian V 2 ip cen (z). This is needed in order 
to calculate the Newton step in the inner iteration of the 
barrier method . In our case, leader node 2 has a modified 
approximate objective, depending on the method (e.g., focused 
diversity or linear penalty method). For completeness, the 
explicit expressions are given below. 

For the focused diversity method d20l . the approximate 
objective of leader node 2 has the form 



^fdm( z 2)=logdet 



'm/2 N 
= 1 i=l 



where 



pm/2 



<?W2), 

(30) 

Therefore, the gradient is given by 



VV>fdm(2 2 ) = diag (A^W am A 2 ) + V0 K (z 2 ), (31) 



where 



Wt 



fdm 




(32) 



The Hessian is given by 

VVfdm(2 2 ) = - {A^WfinAi) (AlW Sdm A 2 ) + V 2 K (z 2 ), 

(33) 

where is the Hadamard product. 

In the case of the linear penalty method (l26T i. the approxi- 
mate objective reads as 

An/2 \ 
ipi P m(z 2 ) = logdet ^2 Z2ia2ia.2i - c T z 2 + 4> K {z 2 )- (34) 

Hence the gradient has the form 

V^i pm (z 2 ) = diag (A^Wi pm A 2 ) -c + V0 K (z 2 ), (35) 
where 



m/2 



T 



(36) 



Since we have a linear penalty term, the Hessian remains 
unchanged when compared to |1 Sec. Ill D], except the 
different dimension m/2 x m/2. 

V. Numerical Example 

We will now compare the naive decentralized sensor se- 
lection (Sec. III-Cb with the proposed heuristics, e.g., focused 
diversity (Sec. IIII-Bb and linear penalty method (Sec. IIII-Cb . 
Our benchmark for all comparisons is the solution from the 
centralized sensor selection problem (Sec. IH-BI >. 

Throughout this section we investigate the following model- 
ing setup. We use m = 100 measurements, n — 40 unknown 
parameters, and a total number of fc s = 40, ... ,60 sensors. 
The measurement matrix is partitioned as given in ( fTOb . The 
entries of the submatrices A\ and A 2 are iid with A/"(0, 1). 
In order to create weakly correlated measurements, we pick 
randomly two different rows from A\ and A 2 , say row i from 
A\ and row j from A 2 . Both rows are then modified as follows 



l£ = \J\ - <T 2 -b T + aw 
= yl — cr 2 • b T + aw 



l 2j 



T 
i ' 

T 
3 ' 



(37) 
(38) 

where b\, . . . , b n , wn , . . . , Wi n , and Wji , . . . , Wj n are iid with 
Af(0, 1). In this formulation, a represents of the strength of 
correlation. In our simulation we modified a total number of 
30 rows and used a = 0.1. 

We first fix the number of shared vectors N = 5, and vary 
only the number of sensors to be used. In Fig. |2(a)| the lower 
bounds L are shown for this case. We can clearly see that the 
focused diversity and the linear penalty method outperform 
the naive decentralized selection, notably for a low number of 
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number of sensors fc 

(b) relative suboptimality gap A re i j 

Fig. 2. Lower bounds (a) and relative suboptimality gap (b) for the four 
different sensor selection strategies, where N = 5. Note that the centralized 
sensor selection serves as a benchmark for all decentralized strategies. 



sensors. Since this gives only a lower bound we introduce the 
relative suboptimality gap 

K^cen ~ Li 



100- 



\u a 



i e {cen, dec, fdm, lpm}. 

(39) 



This gap measures how far we are away from the optimum of 
the (nonconvex) centralized sensor selection problem ©. Note 
that the relative suboptimality gap is a worst-case measure. 
This means in practice we are often closer to this optimum 
than suggested by this gap. The corresponding results are de- 
picted in Fig. |2(b)| When compared to the naive decentralized 
selection, a considerable performance gain for both heuristics 
can be observed. 

As mentioned in Sec. III-DI it is in principle not clear 
how the simple rounding scheme affects the different lower 
bounds, and hence the relative suboptimality gap. In order to 
get a meaningful interpretation, we run a simulation with 10 4 
different random realizations of the aforementioned modeling 
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Fig. 3. Relative suboptimality gap for the four different sensor selection 
strategies, where k s = 40 and TV = 5. 



setup. The results for k s = 40 and N = 5 can be seen 
in Fig. [3] As shown by the histograms, the linear penalty 
method performs on average slightly better than the focused 
diversity method. It is also evident that the decentralized sensor 
selection is far away from being optimal. We have also plotted 
the empirical means of all relative suboptimality gaps. 

The influence of the number of shared vectors on the relative 
suboptimality gap is illustrated in Fig. [4] We assume that all 
vectors are transmitted perfectly, i.e., no transmission errors 
occur. Again, we have used total number of 10 4 realizations 
and fc s = 40 sensors. Each point in Fig. |4] corresponds to 
a mean fii(N). As a reasonable deviation measure, we have 
included standard deviations depicted as error bars around the 
corresponding means. Note that we can not make the gap 
between the centralized selection and decentralized heuristics 
arbitrarily small. The reason is that leader node 1 already made 
a decision, which in turn can be suboptimal from a centralized 
point of view. 
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Fig. 4. Influence of the number of shared vectors N on the empirical mean 
of the relative suboptimality gap, where k s = 40. The error bars around the 
mean values represent the corresponding standard deviations. 



VI. Conclusion 

We have introduced the decentralized sensor selection prob- 
lem for a partition of m sensors between two leader nodes. A 
naive decentralized solution solves the sensor selection prob- 
lem separately for both leader nodes. It was shown, that this 
solution is considerably worse when compared to a centralized 
approach. Therefore, we proposed two simple heuristics that 
allowed a very limited transmission of information from one 
leader to the other. These heuristics try to avoid similar 
contributions from both leader nodes, or equivalently, to ex- 
ploit diversity in possible sensor selections. As suggested by 
extensive numerical simulations, the heuristics outperformed 
the naive decentralized selection by a substantial margin. 
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